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Abstract 


Artificial intelligence (Al) is rapidly transforming the global financial services industry. As a group of 
related technologies that include machine learning (ML) and deep learning (DL), Al has the potential 
to disrupt and refine the existing financial services industry. | review the extant academic, practitioner 
and policy related Al literature. | also detail the Al, ML and DL taxonomy as well as their various 
applications in the financial services industry. 


A literature survey of Al and financial services cannot ignore the econometric aspects and their 
implications. ML methods are all about algorithms, rather than asymptotic statistical processes. Unlike 
maximum likelihood estimation, ML’s framework is less unified. To that end, | will discuss the ML 
approaches of unsupervised and supervised learning. 
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“Al is the ‘new electricity’ ... just as electricity 
transformed many industries roughly one 
hundred years ago; Al will also now change 


every major industry.” 
Andrew Ng, 2007 


“What we're seeing is something 
unprecedented, which is the arrival of artificial 
intelligence, which has a big impact... it creates 
tremendous uncertainty and impacts different 
people differently ... and some people could be 
left out." 


Robert Shiller, 2018 Davos Forum 


1. Introduction 


In 1950, Alan Turing posed the question “Can machines think?” and since then artificial intelligence 
(hereafter known as Al) applications have met with varying degrees of success. However, in recent 
years there has been a resurgence of interest and Al has found innovative applications in the global 
financial services industry. The availability of big data, improved technology, cloud computing and 
faster special purpose hardware have been key drivers of the latest Al innovation wave. Al capabilities 
and machine learning (ML) are boosting growth in the emerging Fintech market. Broadly speaking, the 
term “Fintech” describes the new technologies, services and companies that have changed financial 
services. It includes (but is not limited to): cryptocurrencies, blockchain1, robo-advising, smart 
contracts, crowdfunding, mobile payments and Al platforms. In 2017 Al topped the list as a key trend 
in financial services and Fintech (Future Today Institute, 2017). 


In this literature review, | will detail the Al, ML and deep learning (DL) taxonomy as well as their various 
applications in the financial services industry. | will summarise the current academic, practitioner and 
policy related Al literature. This includes drawing upon economic, finance and computer science 
literature as well as regulatory publications. | specifically discuss four ways in which Al is changing the 
financial services industry: (1) fraud detection (how Al is used to keep criminal funds out of the financial 
system); (2) banking chatbots; (3) algorithmic trading and (4) regulatory and policy aspects. 


Professor John McCarthy coined the term “artificial intelligence” in 1955 and the term “machine 
learning” was coined in 1959 by Arthur Samuel of IBM. Al can mimic actions it has seen or previously 
have been taught about without any new intervention. ML is defined as a particular approach to Al able 
to take the data and algorithms and apply it to new scenarios and patterns without being programmed 
directly. Deep learning (DL) is viewed as a branch of ML. DL provides machines with algorithms 
necessary to understand the underlying principles of an action and significant portions of data. They 
can then be combined to learn on their own and deepen the knowledge and skills with which they are 
provided. High frequency trading (HFT) and algorithmic trading use high speed communications and 
computer programs in the financial services industry. For at least a decade banks have been using ML 
to detect credit card fraud. 


The UK Financial Conduct Authority (FCA) is utilising ML to help individuals manage their current 
accounts. Approximately 9% of all hedge funds use ML to build large statistical models. In 2016, Aidyia 
launched an Al hedge fund to make all its stock trades. Sentient Investment Technologies uses a 
distributed Al system and DL as part of its trading and investment platform. Fukoku Mutual Life 
Insurance uses IBM's Watson Explorer Al to calculate pay-outs. Feedzai uses ML to detect fraudulent 
transactions. UK PropTech? start-up Leverton applies Al to automatically identify, extract and manage 
data from corporate documents such as rental leases. In October 2017, exchange traded funds (ETFs) 
were launched that use Al algorithms to choose long-term stock holdings. 


Like other Fintech sectors, Al offers many opportunities and challenges. In terms of financial inclusion, 
the increased application of Al technology to capital markets is likely to reduce barriers to entry for 
many individuals who might not have previously had access to financial markets. Some of the world’s 
most valuable big tech companies such as Apple, Amazon, Tencent and Alibaba have been pouring 
money into Al research. But as Robert Shiller’s remarks at the 2018 Davos Forum indicate, Al also 
presents a great deal of uncertainty as a disruptive technology. 


In 2016, the GIS-Liquid Strategies group was managing $13 billion with only 12 people. In 2017 
Standard & Poor's (S&P) acquired Kensho for $550 million in the biggest Al acquisition to date. Kensho 
was founded in 2013 with the intention of replacing bond and equity analysts. Its algorithm is dubbed 
“Warren” (after Warren Buffet) and it can process 65 million question combinations by scanning over 
90,000 events such as economic reports, drug approvals, monetary policy changes and political events 
and their impact on financial assets. DeepMind Technologies was purchased by Google and Intel has 
acquired Nervana Systems. In 2017 Opimas LLC estimated that Al would result in approximately 
230,000 job cuts in financial firms worldwide by 2025, with the hardest hit area being asset 
management (with an estimated 90,000 job сиїѕ)?. 


A literature survey of Al and financial services cannot ignore the econometric aspects and implications. 
ML methods are about algorithms, more than about asymptotic statistical processes*. Unlike maximum 
likelihood estimation, ML’s framework is less unified. To that end, | will discuss the ML approaches of 
unsupervised and supervised learning. 


In unsupervised learning, ML can help issue account alerts such as low balance warnings. It can also 
be applied to bank overdraft charges to help ascertain what is happening to individual customers and 
what might be the causes of the situation. This is accomplished by using clustering algorithms. 
Regulators can also use clustering algorithms to better understand trades and categorise business 
models of banks in advance of regulatory examinations. 


The SEC is using topic models to detect accounting fraud. Topic models help us understand the 
behavioural drivers of different market participants. Topic models draw on text mining and natural 
language processing (NLP). Both of these unsupervised techniques are precursors to predictive 
analytics (or supervised ML). Supervised ML entails teaching an algorithm to learn from past breaches 
of regulations and predict new breaches, insider trading and cartel detection. In this literature survey | 
will also discuss Random Forests”, neural networks (a type of deep learning), as well as least absolute 
selection and shrinkage operator (LASSO) regressions. 


Additionally, | plan to address the following knowledge gap - ML is anticipated to have a far greater 
potential impact if it is combined with the processing capabilities of quantum computing. If quantum 
computing becomes a reality, it has the potential to disrupt blockchain. Once this is achieved, what 
does this mean for the financial services industry? 


Al continues to become more sophisticated and complex, but so do the financial markets and this 
presents major challenges in regard to regulation and policy-making. Finally, | will discuss how ML is 
making an impact on the tools regulators use to set policy, detect fraud, estimate supply and demand 
and ensure compliance. In the future, regulators will still need to have procedures in place for 
determining whether a firm or person is at fault. | will detail the international regulatory responses to 
Al and financial services, with an emphasis on the UK. | will consider examining various UK agencies 
such as the Bank of England, the FCA, Serious Fraud Office and the Competition and Markets Authority. 


The paper is set out as follows: Section Two describes the taxonomy and historical overview of Al, ML 
and DL. Section Three details the global growth of Al, followed by three examples of how Al is changing 
the financial services industry in Section Four. In Section Five | describe at length the differences 
between various ML techniques and traditional econometric methods. The impact of the emerging 
field of quantum computing on Al is discussed in Section Six. The regulatory response to Al is provided 
in Section Seven and Section Eight concludes. 


2. Taxonomy and historical overview of Al, ML and DL 


The term “artificial intelligence” was coined in 1956 by John McCarthy. The Oxford English Dictionary 
defines Al as “The theory and development of computer systems able to perform tasks normally 
requiring human intelligence, such as visual perception, speech recognition, decision-making and 
translation between languages.”° FSB (2017) defines Al as, “The theory and development of computer 
systems able to perform tasks that have traditionally required human intelligence.” 


Both are fairly broad definitions. Kaplan (2016) describes Al as, “The essence of Al, indeed the essence 
of intelligence, is the ability to make appropriate generalizations in a timely fashion based on limited 
data. The broader the domain of application, the quicker the conclusions are drawn with minimal 
information, the more intelligent the behavior.” 


The historical development of the Al field is presented in the Appendix. It is clear from the timeline that 
early Al efforts concentrated on rules that included logic-based algorithms. Turing (1950) details an 
operational test (the Turing Test) for intelligent behaviour. In his seminal work, Turing provided the 
major components for future Al work with language, reasoning, knowledge, learning and 
understanding. Through the Turing Test, Turing laid the ground work for ML, genetic algorithms and 
reinforcement learning. The attempt to replicate the logical flow of human decision making through 
processing symbols became known as the “symbol processing hypothesis” (Newell, Shaw, and Simon, 
1957; Newell and Simon, 1961, Gilmartin, Newell and Simon, 1976). 


Much of Al in the 1950s and 1960s did not focus on finance applications. In the 1960s, a substantial 
body of work on Bayesian statistics was being developed that would later be used in ML. Neural 
networks (which would become a cornerstone of deep learning) were developed in the 1960s and grew 
rapidly. However, due to a lack of sufficiently available electronic data and computing power, Al fell out 
of favour into what became known as an “Al winter” (Kaplan, 2016; FSB, 2017). The term “Al Winter” 
also connotes a slowdown in investment and interest. In 1973, the UK Lighthill Report ended 
government support for Al research. 


The 1980s witnessed an Al revival due to new funding and techniques. During the 1980s, Japan, the 
UK and the USA competed heavily in Al funding. Japan invested $400 million through the Japanese 
Fifth Generation Computer Project. The UK invested £350 million in the Alvey Program and DARPA 
spent over $1 billion on its Strategic Computing Initiative. In 1982 Al made inroads into the financial 
services industry when James Simons founded quantitative investment firm Renaissance 


Technologies’. 


This included the development of “expert systems” (or “knowledge systems”) which is a technique that 
solves problems and answers questions within a specific context. Brown, Nielson and Phillips (1990) 
provide an overview of integrated personal financial planning expert systems. They emphasise expert 
systems that use heuristics and the separation of knowledge and control as well as providing examples 
of expert systems that were prevalent at the time. For example, PlanPower provided tailored financial 
plans to individuals with incomes over $75,000. 


The Personal Financial Planning System (PFPS) was used by Chase Lincoln First Bank and Arthur D. 
Little Inc. to undertake investment planning, debt planning, retirement planning, education planning, 
life-insurance planning, budget recommendations and income tax planning. 


Expert systems were also used in the stock market in what was known as “program trading.” At the 
time, institutional investors used program trading to capitalise on pricing disparities in the market. 
Finnerty and Park (1987) provide an empirical study of program trading that identifies discrepancies 
between stock index futures and the underlying stock index. They find the program trading strategy 
consistently outperforms the simple execute and hold to expiration strategy. 


However, program trading was often attributed to the wild market swings of the late 1980s, culminating 
in the 508-point drop in the Dow Jones Industrial Average (DJIA) in 1987. Chen and Liang (1989) detail 
PROTRADER (an expert system prototype for program trading) which is based on a learning 
mechanism based on parameter adjustment to several critical parameters based on market conditions. 
Chen and Liang (1989) describe a successful prediction of the DJIA 87 point drop in 1986. 


Even though expert systems have since declined, one notable example that still remains is Fair lsaac 
Corporation's (FICO)? Blaze Advisor business rules management system (Kaplan, 2016). The late 1980s 
witnessed the rise of IBM and Apple desktop computers. As specialised expert systems became more 
expensive to maintain, a second Al winter ensued. This was also driven by companies that had failed 
to deliver on extravagant promises. This second Al winter lasted until approximately 1993. Despite this 
in 1988, David Shaw founded a hedge fund (D.E. Shaw) that was an early adopter of Al techniques for 
trading. 


After the work of Cheeseman (1985) and Pearl (1988) Bayesian analysis gained greater acceptance in 
addressing uncertainty in Al research. The 1960s and 1970s had been dominated by probabilistic 
reasoning models but Bayesian networks combined classical Al and neural nets and allowed for 
learning from experience. 


In the 1990s, the use of Al in fraud detection garnered more interest. The FinCEN Artificial Intelligence 
System (FAIS)? was put into service in 1993 in an effort to predict and assess money laundering 
incidents. Over the following two years, FAIS would review over 200,000 transactions per week and 
identify 400 potential money laundering incidents, worth approximately $1 billion. 


After the second Al winter concluded, Al advanced more into new areas like machine learning, data 
mining, virtual reality and case-based reasoning. ML is considered a subset of Al and uses algorithms 
to automatically optimise through experience with limited or no human intervention. ML is primarily 
derived from sources such as experience, practice, training and reasoning. More specifically, ML is 
concerned with general pattern recognition and universal approximations of relations in data in cases 
where no a priori analytical solution exists (Cybenko, 1989). 


Machine learning is acknowledged to have originated with the work of McCulloch and Pitts (1943). 
They recognised that brain signals are digital in nature, more specifically binary signals. According to 
Chakraborty and Joseph (2017) each ML system comprises five components: (1) a problem, (2) data 
source, (3) a model, (4) an optimization algorithm and (5) validation and testing. 


ML is best suited for situations that require extracting patterns from noisy data or sensory perception— 
or a data-up approach. The four main drivers of the growth in ML include: the transition from physical 
to electronically stored data; improvements in memory and computing speed, easier access due to the 
internet; and low-cost high-resolution digital sensors. 


By 2011, computer processing power and storage were growing at an accelerating rate and 
supercomputers presented new possibilities for the Al field. Deep Learning became the new Al 
breakthrough. FSB (2017) defines “Deep learning” (DL) as “...a form of machine learning that uses 
algorithms that work in ‘layers’ inspired by the structure and function of the brain. DL algorithms, whose 
structure are called artificial neural networks, can be used for supervised, unsupervised, or 
reinforcement learning.” DL is a statistical technique for finding patterns in large amounts of data. It 
uses neural networks which are based on mimicking the way multiple layers of the brain’s neurons 
work (hence the term “deep”). 


Artificial neural networks were first introduced by McCulloch and Pitts (1943) and are architectures 
which are designed to mimic the way the human brain works. The two primary advantages of DL are: 
(1) itis more resilient than ML to overfitting and (2) DL can address non-linear events such as market 
volatility, which usually have to be adjusted manually in standard quantitative models. In 2012 Google 
had two DL projects in place and by 2017 it had over 1,000 DL projects underway (Makridakis, 2017). 


In the last few years, verification in Al has proved to be essential. On August 1, 2012 Knight Capital lost 
$440 million in 45 minutes after deploying unverified trading software. The “Flash Crash” on 6 May 
2010 was noteworthy for another reason. P&G swung in price between a penny and $100,000 the 
problem wasn’t caused by bugs or computer malfunctions that verification could have avoided. It was 
caused by expectations being violated: automatic trading programs from many companies found 
themselves operating in an unexpected situation where their assumptions were not valid. 


In 2013, during a 17-minute computer glitch, Goldman Sachs flooded the US market with orders to 
purchase 800,000 contracts linked to equities and ETFs. During the same week, Chinese brokerage 
firm Everbright Securities, suffered a malfunction which resulted in it purchasing nearly $4 billion worth 
of shares on the Shanghai market.'° After the Brexit referendum in June 2016, Betterment LLC (a new 
robo-advisor) suspended trading in response to market volatility in order to spare its clients higher 
transactions costs. Betterment LLC relied heavily on algorithmic trading and regulators were 
concerned about when and how Betterment LLC might restrict trading (Brummer and Yadav, 2019). 


3. Global growth of the Al industry 


In the last 60 years the Al field has experienced its share of successes and failures. Currently, 
governments around the world are competing to create superior Al facilities and research with a view 
to Al being a lever for greater economic power and influence. Between 2012 and 2016 the US invested 
$18.2 billion into Al compared with $2.6 billion in China and $850 million in the UK"'. The Japanese 
Government Pension Investment Fund (the world’s biggest manager of retirement savings) is 
considering Al to ultimately replace human fund managers. In February 2018, BlackRock announced it 
would establish an Al Іар '?. With $6.3 trillion assets under management, the firm already employs text 
analysis and analyses corporate website traffic and smartphone geolocation data and is now looking 
at ML to deploy in asset management. 


However, the recent trend has been one of rapid growth. According to a Wushen Institute Report 
(2017), 5,154 Al startups have been established globally during the past five years, representing a 175% 
increase relative to the previous 12 years. There are two explanations for this impressive growth. First, 
exponential advances in computing power have led to declining processing and data storage costs and 
secondly, the immense data availability has increased, creating more possibilities in the Al field. 


Historically, the US has dominated the Al industry. Between 2000 and 2016 there were 3,033 Al 
startups in the US, accounting for 37.41% of the worldwide total (Buchanan and Cao, 2018). However, 
the proportion has been decreasing and in 2016 dropped to under 30% for the first time. During the 
same period, the US received $20.7 billion in funding, accounting for 71.78% of the world's total funding 
(Wushen Institute Report, 2017)'°. 


In 2017 China surpassed the US for the first time in terms of Al startup funding (CB Insights, 2018). In 
2012 China accounted for 48% of global Al startup funding and in 2017 the total global Al funding was 
$15.2 billion. Al equity deals increased 141% relative to the previous year and since 2016 more than 
1,100 new Al companies have raised their first round of equity financing. However, the US is losing its 
global Al equity deal share, decreasing from 77% to 50% of equity deal share during the last five years 
(CB Insights, 2018). 


In terms of Al growth, China leads the Asian market. During the past five years China accounted for 
68.67% of Asian Al startups and corresponding Al funding was 60.22% of the Asian total. Many Chinese 
cities and provinces dominate other Asian countries. In terms of the number of Al companies, there 
are 454 in Beijing, 319 in Guangdong and 224 in Shanghai compared with 57 in Singapore and 283 in 
India (Wushen Institute Report, 2017). 


Beijing has attracted $1,387 million in Al funding, followed by Guangdong ($792 million) and Shanghai 
($154 million), the total exceeding that of Japan ($436.81 million). Beijing’s Al current funding is also 
higher than the UK ($1,251 million) (Wushen Institute Report, 2017). Within China growth has been 
swift in the last three years and Beijing accounts for the majority of Al funding (50.23%), followed by 
Guangdong (28.68%) and Shanghai (5.57%) respectively. 


China has overtaken the US for the number of Al patents over the last five years (Buchanan and Cao, 
2018). Between 2012 and 2016 the compound growth rate of Al patents was 33.2% per annum. 
Currently the US and China hold over 50% of all Al patents (35,508 in US and 34,345 in China) but they 
are being filed at a faster pace in China. The US owns 32% of ML patents and 26% of natural language 
processing patents around the globe. In these same areas, China comes second with 23% and 14% of 
patents respectively". 


Currently China dominates the machine vision patent category (55% of 150,000 patents globally). 
Machine vision refers to object and facial recognition and is useful in public security, healthcare, e- 
commerce and autonomous driving. Two Chinese companies—Face# and SenseTime—are valued as 
“unicorns.” SenseTime is the most well-funded Al startup to date, with a valuation of US$4.5 billion, 
making it the highest valued Al company in history. As Figure 1 (below) indicates, China exceeds the 
US іп terms of Al related patent publications (by a factor of more than 5 times) between 2013 and 2017. 


Figure 1 – Al related patent publications, keyword ‘artificial intelligence’. Source: CB Insights, 2018. 
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For ML related patent publications, the gap is closing between China and the US. Another useful 
measure is paper submissions at conferences. In 1980, there was not a single paper from China 
submitted to the Association for the Advancement of Al Conference. However, in 2018, China 
submitted 25%more papers than US authors. '® Additionally, cross border investment is on the rise, but 
not in an equal manner. As Figure 2 (next page) indicates there are more Chinese investments in Al 
startups in the US, than vice-versa. In fact, the gap between the two widened after 2015. 


Figure 2 - Cross-border Al investments are on the rise. Source: CB Insights, 2018 
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So what explains the increasing dominance of China in the Al area? First, there is the matter of scale 
and China has a massive supply of data. China’s online population (730 million) is almost twice the size 
of the US. Many ML techniques require vast amounts of data. WeChat has over a billion users. 
Secondly, there are two prominent technologies fuelling the drive, namely facial recognition and Al 
chips. '° 


Many applications like deep neural networks need to supplement central processing units (CPU) with 
other processors. Al chips draw on graphic processing units (GPU) technology which is then applied 
to Al, ML or DL problems.'” Both the US and China compete heavily іп Al chip technology. Alibaba is 
aiming to have its first Al chips on the market in 2019. Currently, most chips are coming from US 
chipmakers like Qualcomm and Nvidia. 


In addition, Google has created the Tensor Processing Unit (TPU) and Microsoft uses a processor 
called the field programmable gate array (FPGA). Alibaba is aiming to have its first Al chips on the 
market in 2019. Baidu and JD.com invest heavily in Al both domestically and abroad. Chinese firm, 
Horizon Robotics is also entering this area. Backed by Intel, Horizon Robotics raised up to $1 billion in 
a funding round. Horizon’s Al chips run facial recognition algorithms and identify faces from a database 
of up to 50,000 faces. ' The Al chips for Horizon’s self-driving cars and robots are based on chips that 


run pre-trained DL algorithms. '® 


Chinese companies like Tencent, Baidu and JD.com are Fintech leaders in the Chinese market in terms 
of market capitalisation and user numbers. All three companies are investing heavily in Al. 


However, Chinese and American big tech firms have differed in terms of their Al focus. Microsoft, 
Google and IBM focus on ML, speech recognition and speech synthesis whereas Tencent, Alibaba and 
Baidu focus on image recognition and Al searching.2° Chinese company Cambricon is developing 
chips for DL. Ant Financial uses facial recognition for payments at Alibaba owned retail stores. In 2016, 
Ant Financial, Foxconn and the city of Hangzhou partnered for the “City Brain” project using Al data 
from social feeds and surveillance cameras. Additionally, 55 cities participate in the “sharp eyes” 
project whose surveillance data may end up powering the nation’s Social Credit System, a measure to 
gauge “trustworthiness” (CB Insights, 2018). 


Chinese local governments are offering financial incentives to encourage Al related innovations. Baidu, 
Alibaba and Tencent also collect vast amounts of data on what consumers buy, where they travel, and 
who they chat to online. In fact, Baidu has shifted its business strategy from “mobile first” to “Al first.” 
Baidu’s IPO float of iQyi is anticipated to provide more additional resources to invest in Al ventures. 


In July 2017 the China State Council announced a development plan with the goal of being the world 
leader in Al by 2030. The council presented a timeline where it expects companies and research 
facilities to be at parity with the US by 2020.2' In June 2017 the Chinese government announced plans 
to set up an “intelligence industry zone” near Tianjin along with a $5 billion fund to support the Al 
industry. 


In terms of Al in the United Kingdom, a 2017 government report estimates that Al could add an 
additional USD $814 billion (£630 billion) to the UK economy by 2035 (Hall and Pesenti, 2017). In the 
UK a new Al startup has been established on roughly a weekly basis since 2014.22 Swiftkey, DeepMind 
and Ravn are among the more prominent Al companies with London being the main hub of Al startups. 
Bristol company Graphcore is a leading innovator in the chip market. In 2015 BlackRock acquired 
automated investment platform FutureAdvisor and invested £60 million into early-stage UK venture 
capital fund Forward Partners, which targeted Al investment.”’. Hall and Pesenti (2017) identify three 
areas of finance in the UK where Al has great potential: personalised financial planning; fraud detection 
and anti-money laundering; and process automation. 


"Finance is really a wonderfully pure 


information-processing business." 
D.E. Shaw, 199624 


4. How Al is changing the financial services industry 


Outside of the technology sector, the financial services industry is the biggest spender on Al services 
and is experiencing very fast growth (Citi, 2018). Until recently hedge funds and HFT firms were the 
main users of Al in finance, but applications have now spread to other areas including banks, 
regulators, Fintech, insurance firms to name a few. 


Within the financial services industry, Al applications include algorithmic trading, portfolio composition 
and optimisation, model validation, back testing, robo-advising, virtual customer assistants, market 
impact analysis, regulatory compliance and stress testing. In this section, | discuss three specific areas 
in which Al is currently changing the financial services industry, namely (1) fraud detection and 
compliance; (2) banking chatbots and robo-advisory services; and (3) algorithmic trading. 


4.1. Fraud detection and compliance 


As e-commerce has become more widespread, online fraud has also increased. According to the FCA, 
UK banks spend £5 billion a year in combating financial crime.? Action Fraud reports that between 
2015 and 2016 there was a 66% increase in the number of reported cases of payments-related fraud?’ 
in the UK. US banks spend over $70 billion on compliance each year. Many large banks have had 
massive fines imposed upon them for failing to stop illegal financing, and as a result, many banks have 
turned to Al techniques to improve their operations. 


“Benford’s Law” is one of the simplest ways to detect fraud. It is accomplished by running an analysis 
on the first digits in a given set of data. A predictable distribution of first digits will exist in a set of “real” 
data. Benford’s Law has existed since the late 1800s. 


Al is beneficial here because ML algorithms can analyse millions of data points to detect fraudulent 
transactions that would tend to go unnoticed by humans. At the same time ML helps improve the 
precision of real-time approvals and reduces the number of false rejections. Fraud detection now 
involves more than a checklist of risk factors. Using ML techniques, fraud detection systems can now 
actively learn and calibrate in response to new potential (or real) security threats. Using ML, banks’ 
systems can detect unique activities or behaviours (“anomalies”) and flag them for investigation. 


Credit card fraud detection is one of the most successful applications of ML. Banks are equipped with 
monitoring systems, or workflow engines, that are trained on historical payments data. Algorithm 
training, back testing and validation are based on very large datasets of credit card transaction data. 
Classification algorithms can label events as “fraud” versus “non fraud” and fraudulent transactions 
can then be stopped in real time (van Liebergen, 2017). 
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Many financial services companies are exploring Al-based fraud prevention alternatives. Mastercard 
launched Decision Intelligence (DI) technology and its approach is relatively straightforward. Rather 
than staying limited to a predefined set of rules, DI derives patterns from historical shopping and 
spending behaviour of Mastercard customers to establish a baseline. It is against this baseline that 
Mastercard compares and scores each new customer transaction. This is a major improvement over 
traditional fraud prevention technologies, which rely on a generic template to evaluate all transactions. 


NatWest has adopted a ML solution, namely Corporate Fraud Insights from Vocalink Analytics to detect 
and prevent redirection fraud. Redirection fraud occurs when a business is tricked into paying money 
into a fraudster’s account rather than their intended supplier.’ In fighting money laundering HSBC 


uses Quantexa Al software, a UK based start-up.”® 


Fraud is a latent variable, meaning it is not directly observable but must be inferred from data. As a 
result, it is more challenging for ML algorithms to make accurate predictions of possible fraud than 
shopping decisions (a manifest variable), where retailers have access to full transaction histories 
(Baugess, 2017). Additional challenges exist. A self-defeating goal can arise from declining 
transactions too aggressively in order to prevent fraud. 


A Javelin Strategy Report (2015) indicates that transactions wrongly declined due to suspected fraud 
— known as a “false positive” — may represent just as big a threat to the financial services industry. 
This works against the issuer because a false-positive declined transaction can result in erosion of 
customer loyalty. The same report finds that these “false positives” account for $118 billion in retail 
losses and nearly 39% of declined cardholders report that they abandoned their card after being falsely 
declined. 


“Transactions wrongly declined due to 
suspected fraud account for $118 billion in 
retail losses.” 


Javelin Strategy Report, 2015 


As a response, new technology such as EMV (an acronym for Europay, Mastercard Visa and is also 
known as smart or chip cards) and mobile payments has helped improve card authorisation practices 
and reduce false-positive rates (Javelin, 2015). Lopez de Prado and Lewis (2018) examine why false 
positives are so prevalent in the financial services industry. Citi (2018) estimates that ML methods can 
substantially reduce false declines and improve credit card approvals. 


Other ML applications use data sources other than stock prices and trades. In sentiment analysis, ML 
learning attempts to discover new trends and signals in financial activity and then replicate and 
enhance human “intuition.” 


Hoberg and Lewis (2017) use text-based sentiment analysis of SEC 10-K filings and find that fraudulent 
firms produce verbal disclosure that is abnormal relative to strong counterfactual evidence. They find 
evidence that fraudulent managers over purport good performance and disclose fewer details 
explaining the sources of the firm's performance. Cecchini et al. (2010) use support vector machines 
for detecting corporate management fraud using basic financial data. 


In the case of money laundering detection, a large number of false positive results are often created. 
As a result, ML in the anti-money laundering area has proved more challenging. Firstly, there is a lack 
of large public datasets that can be used to analyse money laundering prediction. Many financial 
institutions still rely on conventional rules-based systems which emphasise individual transactions and 
simple transaction patterns which may not be sophisticated enough to detect sophisticated 
transactions (van Liebergen, 2017). Tang and Yin (2005), Liu et al. (2008), Lv et al. (2008) and Villalobos 
and Silva (2017) detail ML applications to anti-money laundering in a foreign context. Purda and 
Skillicorn (2015) and Perols et al. (2017) detect fraud from financial statements using ML techniques. 


Zhang and Trubey (2018) apply five ML algorithms to US money laundering data. These supervised ML 
algorithms include Bayes logistic regression, decision tree, random forest, support vector machines 
and artificial neural networks. Zhang and Turbey (2017) conclude that artificial neural networks 
consistently outperform parametric logistic regressions. For high event rate regimes with sampling, 
support vector machines surpass logistic regressions and random forests delivers comparable 
performance. Clearly ML algorithms provide future possibilities for money laundering detection. 


4.2. Banking chatbots and robo-advisory services 


The unit cost of financial intermediation in the US has remained at approximately 2% for 130 years 
(Philippon, 2015). After the 2008 financial crisis, the cost of financial intermediation has declined only 
marginally in Europe and the US (Philippon, 2016; Bazot, 2013). As part of this post financial crisis 
response, robo-advisors and chatbots are emerging across the financial services sector, helping 
consumers choose investments, banking products and insurance policies. A “bot” is a software 
application created to automate certain tasks using Al technology (Future Today Institute, 2017). A 
robo-advisor is an algorithm based digital platform that offers automated financial advice or investment 
management services. 


The term “robo-advisor” was essentially unheard-of a decade ago, but it is now relatively commonplace 
in the financial landscape. However, the term is misleading and doesn’t involve robots at all. Instead, 
robo-advisors are algorithms built to calibrate a financial portfolio to the user’s goals and risk tolerance. 
Chatbots and robo-advisors powered by natural language processing (NLP) and ML algorithms have 
become powerful tools with which to provide a personalised, conversational and natural experience to 
users in different domains. 


Chatbots and robo-advisors have gained significant appeal with millennial consumers who do not need 
a physical advisor to feel comfortable investing, and who are less able to validate the fees paid to 
human advisors. 
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There are several ways that Al chatbots can improve the banking industry, including helping users 
manage their money and savings. For example, Plum is a chatbot that can be accessed through 
Facebook Messenger and helps a customer save money in small increments. After the initial 
registration, Plum is connected to the customer's bank account and its Al engine then analyses 
customer income and spending history and then predicts how much they can afford to save. Small 
amounts are then deposited to the Plum savings account with periodic reporting. 


“JP Morgan’s [Al technology] can review 
approximately 12,000 documents in a matter of 
seconds...a human would spend 360,000 hours 
on the same documents” 


Brummer and Yadav, 2019 


Banks are also engaging chatbots to improve their self-service interfaces. The Bank of America has 
launched its Al chatbot Erica? and it is available through voice or message chat on the bank’s mobile 
app. Erica’s Al engine also leverages analytics to assist in managing personal finance. JP Morgan has 
invested in COIN, which is an Al technology that reviews documents and extracts data in far less time 
than a human. СОІМ can review approximately 12,000 documents in a matter of seconds, whereas а 
human would spend more than 360,000 hours of work on the same documents (Brummer and Yadav, 
2019). 


Chatbots and conversational interfaces are a rapidly expanding area of venture investment and 
customer service budget. Such chatbots have had to be built with robust natural language processing 
engines as well as reams of finance-specific customer interactions. Natural language processing is 
making it increasingly difficult for bank customers to tell whether they are talking to an Al interface or 
a human. Japan’s three megabanks are using Al and robotics to streamline customer questions.°° For 
example, the Mizuho Group has a robot that helps answer asset management questions and compiles 
documents. 


Amelia is an Al powered chatbot which assists Allstate Insurance employees. Originally deployed in 
September 2017, Amelia has helped call centre representatives with more than 3 million customer 
conversations.*' Amelia is trained іп 40 insurance related topics and uses DL and natural language 
processing as well as data analytics to understand the intent of the user’s text and offer precise 
answers. 


Lemonade is a B2C platform that provides property and casualty insurance to home owners and 
renters. It uses ML and chatbots for customers. On average, it takes 90 seconds to get insured and 
three minutes to get paid for a claim. In China, dialogue robots are an Al technology that are currently 
being utilised in the peer-to-peer (P2P) industry. 


An estimated $200 billion debt mountain has formed in the Chinese P2P industry and since mid-2017 
many lenders have shut down as lending controls have been implemented and additional licenses 
required. 


In 2016, Ziyitong was established and launched an Al platform to help recover an estimated RMB150 
billion in delinquent loans.* The Al platform helps recover delinquent loans for approximately 600 debt 
collection agencies and over 200 lenders (including the Postal Savings Bank of China and Alibaba). The 
Al platform is a dialogue robot which utilises information about borrowers and their friends’ network. 
The dialogue robot uses the information to determine the phrasing with the highest likelihood of 
pressuring the borrower to repay the loan. The dialogue robot will also call the borrower's friends, 
encouraging repayment of the loan. Ziyitong claims its recovery rate is 41% for large clients and loans 
that are delinquent up to one week, a rate that is twice that of traditional debt collection methods. 


Currently the UK and Germany dominate growth in the European robo-advisory industry.°? However, 
the US is still considered a major hub of the robo-advisory sector. There are between 98 and 126 robo- 
advisory services in Europe compared with approximately 200 in the US. Robo-advisors use ETFs in 
stocks or bonds for portfolio construction. A low interest rate environment has been a major driver of 
this growth. As interest from bank deposits have become more negligible demand has grown for 
investment services because robo-advisors provide an inexpensive, convenient platform. 


Robo-advisors have the potential to lower costs and increase the quality and transparency of financial 
advice for consumers. Rohner and Uhl (2017) describe robo-advisory services in three contexts: (1) 
access to and rebalancing of passive and rule-based investment strategies, (2) cost-efficient 
implementation of a diversified asset allocation, and (3) behavioural biases. They find that compared to 
traditional investment advice, robo-advisors can save costs of up to 4.4% per year. 


4.3. Algorithmic trading 


Algorithmic trading (AT) has become a dominant force in global financial markets. Also called 
“Automated Trading Systems,” AT’s origins date back to the 1970's. Kirilenko and Lo (2013) provide a 
brief survey of the evolution of the AT field. Chakravorty (2016) defines AT as: “Algorithmic trading is 
about implementing trading rules into a program and using the program to trade, [and Al trading] can 
be defined as an approach to machine learning that learns the structure of the data, and then tries to 
predict what will happen”. 


Algorithmic trading now involves the use of complex Al systems to make extremely fast trading 
decisions. Computers generate 50-70% of equity market trades, 60% of futures trades and 50% of 
Treasuries (Brummer and Yadav, 2019). Aldridge and Krawciw (2017) estimate the share of market AT 
to be closer to 40%. 


The benefits of AT include (1) the ability of trades to be executed at the best possible prices, (2) 
increased accuracy and a reduced likelihood of mistakes, (3) the ability to automatically and 
simultaneously check multiple market conditions and (4) human errors caused by psychological or 
emotional conditions are likely to be reduced. 


15 


In relation to the second benefit, the European Space Agency's Mosaic Smart Data algorithms are 
being now deployed to mitigate "fat finger" trades.** A fat finger trade is where trader accidentally 
presses the wrong key. Mosaic Smart Data can be deployed to analyse millions of financial trading data 
points and have recently been high profile cases at Samsung and Deutsche Bank. The algorithms are 
also being adopted to detect fraud in the financial services industry. 


Algorithmic trading’s target clientele is hedge funds, proprietary trading houses, bank proprietary 
trading desks, corporates, and the next generation market makers. AT includes making certain trading 
decisions, submitting orders, and managing those orders after submission. Biais, Foucault, and Moinas 
(2011) and Martinez and Rosu (2011) argue that algorithmic speed should have a positive effect on the 
informativeness of prices. Hendershott et al. (2013) find that AT improves liquidity and enhances the 
informational content of quotes. On the other hand, AT may also impose higher adverse selection costs 
on slower trades. 


Algorithmic systems often make thousands or millions of trades per day. The term given to this is HFT. 
HFT is the most recognisable form of AT and uses high-speed communications and algorithms in 
financial market transactions. HFT has both its supporters and detractors. Since 2013, two-thirds of 
the top 30 cited papers on HFTs show positive market effects from HFTs (Das, 2017). There are 
supporting arguments that HFT helps with price discovery and efficiency by trading in the direction of 
permanent price changes and in the opposite direction of transitory pricing errors (Broggard et al. 
2014). These types of trading improve market liquidity. Hendershott and Riordan (2013) find that HFT 
can provide market stability and Menkveld (2014) finds that HFT reduces trading costs. Hasbrouck and 
Saar (2013) provide evidence that HFT improves market quality and reduces bid-ask spreads. In fact, 
HFT is changing the traditional field of market microstructure and will continue to be reinvented 
through new Al and DL techniques. 


Most hedge funds and financial institutions do not openly disclose their Al approaches to trading (for 
proprietary reasons), but it is believed that ML and DL play an important role in calibrating real-time 
trading decisions. It also involves neural networks, fuzzy logic and pattern recognition. There are four 
common AT strategies that are now briefly described. Signal processing is a mathematical extension 
of technical analysis based on the art of filtering to eliminate noise and discern trading patterns. 


Secondly, there is a strategy is known as market sentiment. In this strategy the computer is entirely 
unaware of market activity until it is fed model market data flows and then the algorithm becomes 
aware of market agitation and participant activity. The objective of market sentiment is to provide the 
algorithm the appropriate context to analyse and learn market psychology of supply and demand. Third, 
there is an AT strategy known as news readerwhich does not react to major political events unless it 
is taught how to artificially read news headlines. Finally, the AT strategy known as pattern recognition 
enables a machine to learn, adapt and react when patterns arise creating revenue opportunities. 


US based Sentient Technologies is an Al company that operates a hedge fund and has developed an 
algorithm that processes millions of data points to find trading patterns and forecast trends. Based on 
trillions of simulated trading scenarios, Sentient’s algorithms use those scenarios to identify and blend 
successful trading patterns and devise new strategies. 


These techniques enable Sentient Technologies to reduce 1,800 days of trading to just a few minutes. 
Sentient calls its successful trading strategies “genes,” which are then tested in live trading. Here the 
genes evolve autonomously as they gain experience. Kensho’s algorithm processes millions of market 


data points to discover correlations and investment opportunities. 36 


Another hedge fund, Numerai also uses AT to make trading decisions. Instead of developing the 
algorithms internally, Numerai outsources algorithm development to thousands of anonymous data 
scientists, who compete to create the best algorithms. In turn, they win cryptocurrency for their efforts. 
The way in which Numerai shares trading data with the scientists prevents scientists from replicating 
the fund’s trades while allowing them to build models for better trades. In December 2017, ING 
launched an Al bond trading tool, Katana.°” Katana shows human traders at the emerging market desk 
how to gather bond prices more rapidly. After a six-month trial, Katana led to cutting costs by 25% and 
faster pricing decisions 90% of the time. UBS and JP Morgan have already introduced Al into their 
trading tools. JP Morgan uses Al algorithms to execute equity trades and UBS has used Al techniques 


to trade volatility®®. Al algorithms are also being used to guide venture capital investments.°?. 


The practice still has many sceptics, especially with traditional traders who are dubious about the lack 


40” nature of Al algorithms. Yadav (2015) states that algorithms used in 


of transparency and “black box 
markets possess “model risk” or the risk that algorithmic programming may not accurately represent 
the world as it is. Pasquale (2015) states that one drawback is that high frequency traders mimic what 
other traders are doing without exploring the underlying value of the company whose shares are being 
traded. AT can also potentially freeze markets and create instability when trading strategies interact in 


unforeseen ways. 


HFT for example has been blamed for the 6 May 2010 “Flash Crash”*! and is described in Kirilenko et 
al. (2017), Kirilenko and Lo (2013) and Lewis (201 4). On 6 May 2010, the S&P500 plummeted more than 
8%, before quickly rebounding. It was described as the first market crash in a new era of automated, 
algorithmic trading. During the 2016 Brexit referendum, there was a flash crash of the pound sterling 
by approximately 6% in two minutes.” In 2015, algorithms based on volatility quantitative strategies 
were attributed to the August market turmoil.** After market opening on 24 August, the S&P500 
triggered circuit breakers nearly 1300 times. 


Most empirical research indicates that evidence-based algorithms more accurately predict the future 
than do human forecasters. What about “algorithm aversion?” This is when choosing between a human 
forecaster or a statistical algorithm, a person will often choose the human forecaster. Dietvorst et al. 
(2016, 2015) document that people are especially averse to algorithmic forecasters after seeing them 
perform, even when they see them outperform a human forecaster. This is because people more 
quickly lose confidence in algorithmic forecasters compared to human ones after seeing them make 
the same mistake. 


Through an experiment Dietvorst et al. (2016) determine that participants’ preference for modifiable 
algorithms is indicative of a desire for some control over the forecasting outcome, and not for a desire 
for greater control over the forecasting outcome. So, participants would prefer an algorithm they could 
modify. 
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O'Neill (2017) addresses the potential biases in AT. Kirilenko and Lo (2013) recognise the 
interconnectedness of global financial markets and the challenges and unintended consequences that 
brings with it. They list the following unintended consequences: flash crashes, fire sales, careless IPOs, 
cyber security breaches, and catastrophic trading errors. They call for a more adaptive and systematic 
approach to regulating AT, where the technological advances of the industry are addressed whilst 
protecting those who are not as technologically advanced. A UK House of Lords report in April 2018“ 
suggests that the Al sector's full potential would only be realised if potential risks such as algorithmic 
bias and the opaqueness of "black box" systems can be mollified. 


4.4. Other applications of Al 


Today, ML has come to play an integral role in many aspects of the financial ecosystem, from approving 
loans, to managing assets, to assessing risks. The global real estate industry is worth over $200 trillion 
(Barnes, 2016) and has been significantly transformed by these new technologies. Proptech is the term 
that refers to the new and emerging technologies that are disrupting real estate markets. Al is being 
used as part of the new real estate business models. 


Founded in Germany, Leverton is an Al powered data extraction platform that uses DL algorithms to 
automatically extract key information out of documents such as rental leases, break options and overall 
clauses. Structured data can be easily accessed from a platform that is understandable in 20 
languages.*° The platform also has a traceable audit between the underlying documentation and 
structured data output. In Singapore proptechs use Al to derive formulae calculating the value of a 
property using a mix of comparative market analysis and algorithms, instead of through manual 


valuation.*6 


Al algorithms can also be applied to corporate governance contexts. Erel et al. (2018) use ML 
algorithms to select company directors based on performance. The ML algorithm “learns” from the 
way directors were selected in the past. They find ML has the potential to improve corporate 
governance practices because the directors predicted to perform poorly by algorithms, perform much 
worse than those predicted to perform well. Routledge et al. (2018) investigate phrases associated with 
mergers and acquisitions (M&A) deal incidence and recent performance. Li et al. (2018) use 
unsupervised ML to capture corporate culture and its role in M&A activity. 


The loan and insurance underwriting business is an ideal environment for ML in finance. At large banks 
and publicly traded insurance firms ML algorithms can be trained on millions of consumer data items 
(age, job, marital status, etc.) and financial lending or insurance results (has the person defaulted, paid 
back the loan on time, been in a car accident, etc.). Using mobile phone data, Bjorkegren and Grissen 
(2015) employ ML methods to predict loan repayments. 


The underlying trends that can be assessed with algorithms, and continuously analysed to detect 
trends that might influence lending and insuring into the future (are more and more young people ina 
certain state getting in car accidents)? In the insurance industry, Cytora is using Al to make better risk 


assessments about their customers, leading to more accurate pricing and minimising claims.*’ 
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“Machine learning promises to shake up large 


swathes of finance.” 
The Economist, 2017 


5. Econometrics versus ML 


The goal of both statistics and ML methods is to learn from data (Frame et al., 2018). However, ML 
methods are not guided by economic theory and are more about algorithms, rather than about 
asymptotic statistical processes. Traditional statistics highlights hypothesis testing and inference, 
whereas ML methods emphasise obtaining the best prediction. Unlike maximum likelihood estimation, 
ML’s framework tends to be less unified. To-date there have been relatively limited data intensive 
applications of ML and DL in the finance literature. However, the increase in processing power, the 
emergence of big data, better algorithms and growth in Fintech after the 2008 financial crisis have led 
to an increase in ML and DL techniques and applications. 


Most fields (including finance) have traditionally employed models like linear regression where the 
curve fit to the data is usually a straight line (Domingos, 2017). However, most data tends to exhibit 
nonlinearity. Several ML methods are able to infer non-linear relationships. The key difference between 
ML and conventional econometric analysis is its larger focus on prediction compared to summarisation 
and causal inference (Varian, 2014). ML emphasises “high dimensional prediction problem” and 
traditional statistics emphasises “formal statistical inference (confidence intervals, hypothesis tests, 
optimal estimators) in low dimensional problems” (Wall, 2017). Because of this, ML models are not 
evaluated on the basis of statistical tests, but on their out-of-sample prediction performance. 


This means that a ML model describes situations it has not seen before. In order to create a powerful 
out-of-sample prediction, the model is created by running variables and the model on data subsamples 
to identify the most powerful predictors. Then testing is conducted (thousands of times) on different 
data subsamples. This is done so that the model can learn from the data and improve its predictive 
performance (van Liebergen, 2017). As an example, Khandani et al. (2010) apply ML methods to 
consumer credit risk models and are able to construct out-of-sample forecasts that significantly 
improve the classification rates of credit-card-holder delinquencies and defaults. They estimate cost 
savings ranging from 6% to 25% of total losses. 


There are some drawbacks to ML and DL methods. One drawback of ML is that one may struggle to 
explain why a model is doing what it does, commonly known as the “black box” criticism. The process 
by which DL techniques reach decisions is also unclear. Deep Learning techniques provide 
predictions, but they do not provide insight into how the variables are being used to reach these 
predictions (Wall, 2017). This is especially important for trying to prevent discrimination in lending 
models (which has been studied in the peer-to-peer lending literature). 
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Overviews of many ML techniques are provided by Varian (201 4), Einav and Levin (2013), Mullainathan 
and Spiess (2017), Chernozhukov, Chetverikov, Demirer, Duflo, Hansen and Newey (2017), and 
Chernozhukov, Chetverikov, Demirer, Duflo, Hansen, Newey and Robins (2017). Athey (2015) details 
how ML relates to causal inference. Many of these reviews provide more technical discussions of 
various ML methods. ML algorithms are categorised as either supervised learning or unsupervised 
learning. 


The goal of supervised learning is to make predictions by learning patterns from labelled data, whereas 
in unsupervised learning the aim is to model the underlying structure or distribution in the data to learn 
more about the data. Generally, ML can also be employed in three types of statistical problems: 
clustering, classification and regression. Clustering is classified as unsupervised ML and regression 
and classification are considered supervised ML Figure 3 (next page) illustrates the distinction 
between unsupervised and supervised learning methods. 


5.1. Unsupervised machine learning 


Unsupervised ML is defined as a system that learns without needing human assistance and “labelled” 
data to teach it. The algorithm aims to find structure in the data, usually by grouping observations, or 
clusters. Clustering algorithms, dimension reduction, feature extraction and topic models (including 
natural language processing and text mining models) are examples of unsupervised ML methods. 
Descriptive techniques such as clustering are often used to gain insights into the mechanisms of 
certain parts of the financial markets. Unsupervised learning can potentially be quite useful in 
applications involving text, images, or other very high-dimensional data (Athey and Imbens, 2017). 


5.1.1. Clustering algorithms 


Clustering algorithms are used to understand consumer and trader behaviour. One example is bank 
overdraft charges in order for bankers to get a better understanding of what is happening to these 
individuals and what might be the causes of the situation. Regulators can use clustering algorithms to 
understand traders better (or even insider trading) and categorise business models of firms in advance 
of regulators’ visits. 


5.1.2. Topic models 


Topic models help us understand the behavioural drivers of different market participants. The area can 
be further subdivided into natural language processing and textual analysis. One of the earliest 
applications of ML at the US SEC was to use text mining and natural language processing to detect 
accounting fraud (Baugess, 2017). Tetlock (2007) uses a popular daily Wall Street Journal column to 
examine the interactions between the media and stock prices. He finds that high media pessimism can 
predict downward pressure on market prices which followed by a reversion to fundamentals. He finds 
unusually high or low pessimism predicts high market trading volume. 
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Figure 3 - Machine learning taxonomy 
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5.1.2. Topic models (cont.) 


Tetlock et al. (2008) use a dictionary-based approach and examine whether language measures can 
be used to predict individual firms' accounting earnings and stock returns. They find that media content 
captures otherwise hard-to-quantify aspects of firms' fundamentals, and that investors quickly 
incorporate this into stock prices. Using ML methods applied to internet messages about stocks, Sinha 
(2010) argues that the market underreacts to the tone of news articles. 


Li (2008) classifies the tone of forward-looking statements, showing its association with firm 
characteristics using a naive Bayes algorithm. Manela and Moreira (2017) construct a text-based 
measure of uncertainty starting in 1890 with WSJ front-page articles. They measure news implied 
volatility (NVIX) peaks during stock market crashes, times of policy-related uncertainty, world wars, and 
financial crises. Between 1890 and 2009 a high NVIX predicts high future returns in normal times and 
rises just before transitions into economic disasters. 


Guo et al (2016), Kearney and Liu (2014), Loughran and McDonald (2016) provide a surveys of textual 
analysis applications in the accounting and finance literature. They observe the challenges of tonal 
classification to be the use of slang, sarcasm, emoticons and the constantly changing vocabulary on 
social media. Bao and Datta (2014) use advanced textual analysis to investigate topics in Form 10-Ks 
related to risks (10-Ks are annual reports required by the SEC). Huang et al. (2018) use this approach 
to quantify the content of analyst reports and the role of information discovery and interpretation. 


Antweiler and Frank (2004) examine whether internet stock message boards can move markets by 
studying Yahoo! Finance and Raging Bull messages about the companies in the DJIA and the Dow 
Jones Internet Index. They find that stock messages help predicts market volatility. However, whilst the 
effect on stock returns is statistically significant, the economic significance is small. Renault (2017) 
examines the microblogging platform StockTwits and provides empirical evidence that online investor 
sentiment helps forecast intraday stock index returns. 


5.2. Supervised machine learning models 


Supervised learning is the case where the chosen algorithm tries to fit the target using the given input 
features. A set of “training data” that contains labels on the observations is supplied to the algorithm. 
For example, the labels could be solvent/insolvent. Based on a massive set of data, the algorithm will 
“learn” a classification rule that it will use to predict the labels for the out of sample observations. For 
supervised learning methods the test sample might have 10% of observations (Athey and Imbens, 
2017). 


The difference between supervised and unsupervised ML is that the latter lacks labelled training data 
and has to determine correlations by itself. Examples of supervised learning include predictive 
analytics, random forests, neural networks and LASSO techniques. Introduced by Tibshirani (1996), 
LASSO stands for least absolute shrinkage and selection operator. Belloni et al. (2014) provide a very 
thorough discussion of how LASSO techniques can be applied to high dimensional data. 
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Reinforcement learning has its roots in the pioneering work of Bellman (1957) on optimal control and 
falls between unsupervised and supervised machine learning (FSB, 2017). In reinforcement learning 
the connection between actions and rewards is unknown and must be inferred from the agent's 
interactions with the environment. An unlabelled set of data is fed to an algorithm and then the 
algorithm chooses an action for each data point. A human may then provide feedback (the supervised 
part) that subsequently helps the algorithm learn. Corazza and Bertoluzzo (2014) and Ritter (2017) apply 
reinforcement learning to dynamic trading strategies. 


5.2.1. Predictive analytics 


Both clustering algorithms and topic models are precursors to predictive analytics, a form of 
supervised machine learning. This is where one teaches an algorithm to learn from past breaches of 
regulations and to detect new breaches in regulation. This has potential applications in cartel detection 
and insider trading. 


5.2.2. Random forests 


The random forest model was developed by Breiman (2001). The technique is based on decision tree 
models and is also known as generalised classification and regression trees (CART). Breiman (2001), 
Booth et al. (2014) and Bagherpour (2018) provide a detailed technical overview of random forest 
methodology. The random forest model applies the bootstrap aggregating technique to tree learners. 
The random forest algorithm is extremely useful for forecasting as well as allowing classification on 
whether a variable is more or less important when constructing the decision tree. 


Yeh et al. (2014) find that random forest techniques are very robust and allow for the presence of 
outliers and noise in the training set. JPMorgan researchers consider that random forest shows 
promise for trading 10-year US Treasury market instruments. Medeiros et al. (2017) recognise the 
nonlinearity property with inflation dynamics and compare 16 ML methods against benchmark 
statistical models. In general, ML methods with a large set of covariates provide superior results 
compared to univariate benchmarks and factor models. Medeiros et al. (2017) find that random forests 
is the best model indicating a degree of nonlinearity in the dynamics of inflation. Using ML applications 
in bankruptcy prediction, Barboza et al. (2017) find that random forest techniques outperform other 
methods. 


5.2.3. Neural networks 


In 2012 the full potential of neural networks (NN) applications in finance first came to wider attention. 
NN are a form of DL methodology. Das (2017) observes that DL has been proven to uncover subtle 
nonlinearities in data that are not discoverable using standard linear econometric models. General NN 
consist of three types of layers: input layer, a hidden layer and output layer, and each layer also consist 
of multiple units. Lacher et al. (1995) use NN to assess corporate financial health. They find that NN 
overcome the limitations of multi-discriminant analysis (MDA) such as multivariate normality, linear 
reparability and independence of the predictive variables and achieve superior results. 


23 


Altman et al. (1994) compare artificial neural networks (ANN) with traditional statistical methodologies 
including logit analysis and discriminant analysis. They analyse the financial health of 1,000 Italian 
industrial firms between 1982 and 1992 and find NN display acceptable classification and holdout 
sample accuracy. Using S&P data between 1970 and 1989, Coats and Fant (1999) find that NN are 
more effective than MDA for early detection of financial distress and identify firms which receive 
opinions regarding whether the firm is still a going concern. 


Other papers that show NN techniques are at least as good as MDA methods include: Williams (1985); 
Odom and Sharda (1990); Utans and Moody (1991) when applied to bond ratings predictions. NN 
performs well with regard to limited availability of data and by the lack of complete a priori information 
that could be used to impose a structure to the network architecture. Garavaglia (1991) also looks at 
bond ratings using NN. Salchenberger et al. (1992) compare NN with MDA techniques using thrift 
failure applications. They find that NN require fewer assumptions and achieve a higher degree of 
prediction, accuracy and is more robust than traditional statistical methods. 


Tama and King (1992) compare NN methods with linear classifiers and logistic regression by applying 
these to Texan bankruptcies. Angelini et al. (2008) apply NN to model credit risk for Italian small and 
medium sized enterprises and find that NN methods perform very well. Hutchinson et al. (1994) use 
NN algorithms and find it is an excellent framework to model options pricing functions. Amilon (2003) 
employs daily call option pricing data to examine whether NN can outperform the Black-Scholes 
formula. NN are applied to out of sample pricing and delta hedging and Amilon (2003) finds NN 
outperform the Black-Scholes formula both in hedging and pricing performance. 


Culkin and Das (2017) employ a fully-connected DL NN to reproduce the Black and Scholes (1973) 
option pricing formula. First, Culkin and Das (2017) start with the application of learning the Black- 
Scholes option pricing model from simulated data. The next step is that market data is used to train an 
option pricing model that performs better in terms of predictive accuracy, adaptability and robustness, 
which is their main result. This is achieved with a high degree of accuracy. Li et al. (2018) employ DL— 
specifically, a method that uses layers of ANN to automatically process earnings calls and score 
cultural values for companies. Their ML approach yields a high-quality culture dictionary useful for 
measuring corporate culture which is also scalable to a large collection of textual data. 


5.2.4. Support vector machine (SVM) 


Support vector machines are related to and contain elements of ML, NN and non-parametric applied 
statistics. Bagherpour (2018) provides an excellent detailed technical discussion of the SVM 
methodology. Cecchini et al. (2010) use SVMs for detecting corporate management fraud using 
financial data. Société Générale, the French bank, on the other hand, says support-vector machines 
are better, over time, for making long/short equity investment decisions. Hardle et al. (2008) employ 
smooth SVMs to predict firm default risk. Hardle, Moro and Hoffmann (2011) find SVMs are capable of 
extracting the necessary information from financial balance sheets and then to predict the future 
company solvency or insolvency. 
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6. Machine learning versus quantum computing 


Quantum computing (QC) is a relatively new field of research that studies the algorithms and systems 
that apply quantum phenomena to complex problems. QC can potentially process data at speeds that 
are impossible for traditional computers. Traditional computers only process information in a binary 
format: in zeroes or ones. Benioff and Feynman established the field in the early 1980s and noted that 
whereas digital computers (DC) could not efficiently simulate a probabilistic system, but QC could. 


QCs are much better suited than DCs to solve financial problems because QCs operate with random 
variables, whereas DCs just simulate random variables. QC can hold multiple states simultaneously 
and these co-existing states are called qubits. Qubits are able to process four values at any given time 
and allows the computer to parallel process information (Future Today Institute, 2017). 


The fact that qubits are memory elements may hold a linear superposition of both states is a game 
changer for the financial services industry. Rolling IT security is one of the most famous applications 
of QC. Rosenberg et al. (2015) apply QC to portfolio optimisation problems. Lopez de Prado (2016) 
suggests that QC has potential with finance problems that involve scenario analysis or option pricing. 


In the case of scenario analysis, QC can evaluate an extremely large number of outcomes that have 
been generated at random and for option pricing QC can evaluate a large number of paths that can be 
computationally expensive. Looking ahead, ML is expected to have a far more powerful impact if it is 
combined with QC capabilities. Lopez de Prado (2016) describes how QC has potential to more 
efficiently refine clustering algorithms. 
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7. Regulation and policy-making 


Al and ML are moving faster than policy makers can understand to the extent it is almost outstripping 
the current legal and regulatory framework. Technology is opaque and fast moving and regulators find 
it hard to keep pace, for both the cumulative impact and risks of contagion. Athey and Imbens (2017) 
and Mullainathan and Spiess (2017) argue that ML methods hold great promise for improving the 
credibility of policy evaluation. 


The technology underpinning Fintech is also fuelling a spinoff field known as RegTech which aims to 
make compliance and regulatory activities easier, faster and more efficient. RegTech utilises Big Data 
and ML. RegTech is an emerging field to reduce costs and increase effectiveness. Alarie, Niblett and 
Yoon (2016) explore how ML technology can improve regulation of human behaviour. They argue that 
ML techniques can provide fast, accurate and consistent judgements, and streamline operations with 
reduced error. Baracos and Selbst (2016) examine how fairness and non-discrimination can be 
implemented in learning programs used in banking, insurance and the justice system. 


One example of RegTech can be found at the Hong Kong Stock Exchange which is employing Al 
software used by NASDAQ to detect stock manipulation and market abuses. The software analyses 
historical trading activity to identify patterns and anomalies, including dramatic swings in stock prices 


and rises in trading volume.*® 


Financial regulators are also exploring the use of Al for better monitoring of financial institutions. The 
UK Financial Conduct Authority (FCA) is examining the possibility of making its handbook machine- 
readable and then fully machine-executable. This would mean that machines can interpret and 
implement the rules directly (Citi, 2018). The Division of Economic and Risk Analysis (DERA) at the SEC 
is exploring ML to extract actionable insights from massive datasets, helping examiners find cases of 
potential fraud or misconduct (Baugess, 2017). The Monetary Authority of Singapore is examining how 
Al/ML methods can be used to identify suspicious transactions. This will allow Singapore regulators 
to better devote resources to higher risk transactions (FSB Report, 2017). 


International regulators utilise Al-supported analytical methods to recognise vulnerability patterns, 
scan lengthy reports or analyse incoming data. In 2017 the Bank of England (BoE) joined forces with 
MindBridge to use an Al auditor to help detect anomalies in transactions and reports. The BoE will be 


utilising ML to achieve these goals.“ 


Chakraborty and Joseph (2017) outline how ML methodologies can be applied in the context of central 
banking. They provide three ML case studies (two supervised learning cases and one unsupervised 
learning case) to compare ML methods against traditional econometric methods. They examine ANN, 
tree-based models, support vector machines, recommender systems and different clustering 
techniques. 


Chakraborty and Joseph (2017) conclude that ML methods generally outperform traditional modelling 
approaches in prediction tasks. However, as regards causal inference properties, open research 
questions remain. 


26 


In 2018, Chancellor Angela Merkel announced that the German government would spend ЄЗ billion to 
boost Al capabilities. The Deutsche Bundesbank is already using Al in its risk management area and 
uses NN to assess financial market soundness. On 25 May 2018 the General Data Protection 
Regulation (GDPR) came into force. Under GDPR, EU citizens have the right to receive an explanation 
for decisions based solely on automatic processing. Under recital 71 of the GDPR, “the data subject 
should have the right not to be subject to a decision, which may include a measure, evaluating personal 
aspects relating to him or her which is based solely on automated processing and which produces 
legal effects concerning him or her or similarly significantly affects him or her, such as automatic 
refusal of an online credit application or e-recruiting practices without any human intervention.” 


Furthermore, GDPR stipulates that companies must first obtain consent from an EU citizen before 
using consumer data. If the EU citizen's data is stored on servers located outside of the EU region, 
GDPR rules apply. Failure to comply to GDPR can result in substantial fines: up to $22 million or 4% of 
a company’s revenues (Deloitte, 2018). 


The European MIFID 159 (which also came into effect in 2018) requires that firms applying algorithmic 
models based on Al and ML should have a robust development plan in place. Firms need to ensure that 
potential risks are included at every stage of the process (Wuermaling, 2018). In February 2018 the FCA 
and Prudential Regulatory Authority released consultation papers on algorithmic trading which lists 
key areas of supervisory focus in relation to MIFID II. 


Prior to the 2008 financial crisis, ratings agencies used algorithmic assessments of creditworthiness 
to rate mortgage backed securities (MBS). The 2010 Dodd Frank Act requires ratings agencies to 
disclose material changes in ratings agencies methodologies. The US SEC has been testing ML 
applications in regulatory settings (Bauguess, 201 7). Early use of Al at the SEC were simple text analytic 
methods used to determine if CDS risks in the run up to the global financial crisis could have been 
anticipated (Baugess, 2017). The first mention of a CDS contract in a US 10K filing was in 1998. By 
2004, more than 100 corporate issuers mentioned their use in 10Ks. A large increase in disclosures 
occurred in 2009 after the crisis was in full swing. Even though this early attempt did not work as 
planned, it was an early example of proof of concept and structuring certain phrases. 


The next stage the SEC pursued was to use natural language processing efforts to measure the 
probability of certain words in SEC documents (or unsupervised learning). Topic models were deployed 
to analyse the information in the tips, complaints, and referrals received by the SEC. Additionally, 
tonality analysis is employed to gauge the negativity of a filing by counting the appearance of certain 
financial terms that have negative connotations. The objective is to learn whether themes can be 
classified directly from the data itself that would enable more efficient allocation of resources. 


In 2013 the SEC disclosed that they were using a natural language processing model to detect 
accounting fraud. The SEC now employs a Corporate Issuer Risk Assessment Program which provides 
230 metrics to staff in identifying fraud and misconduct (Hunt, 2017). Now for many cases the SEC 
utilises a two-stage process: unsupervised learning methods to identify general themes and then 
supervised ML algorithms to predict the presence of idiosyncratic risks. Baugess (2017) observes that 
back testing of the ML methods performs very well. 
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The UK Serious Fraud Office (SFO) includes Al in its work. In a typical year the SFO processes over 100 
million documents in fraud and corruption cases. This includes the Rolls-Royce bribery case, which 
resulted in the largest ever fine imposed in the UK for criminal сопаисї.5' The SFO used the RAVN 
robotic system, which ended up costing £50,000, and saved UK taxpayers hundreds of thousands of 
pounds. RAVN is referred to as a Legal Professional Privilege (LPP) robot. RAVN sifts documents into 
“privileged” versus “non-privileged” piles, indexes and compiles summaries. In the Rolls-Royce case, 
RAVN processed 30 million documents at a rate of up to 600,000 per day (compared with a team of 
lawyers that would have processed 3,000 per day). The SFO is also considering other advanced ML 
techniques in other aspects of its operations. 


Wuermaling (2018) provides six warnings for central bankers in regard to Al: (1) Don’t miss out on 
opportunities of Al in finance; (2) beware of the risks (3) consumers should take care: they remain risk 
takers (4) Fintechs should not ignore the legitimate concern of society and supervisors (5) Al needs 
new forms of supervision and (6) central banks should embrace Al. 
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“The rise of powerful Al will be either the best 
or the worst thing ever to happen to humanity. 
We do not yet know which.” 


Stephen Hawking 


8. Conclusion and directions for future research 


Artificial intelligence in the financial services industry is still in early days. Al will become more 
ubiquitous in finance, and with that comes more challenges including legal, ethical, economic and 
social hurdles. Al will also continue to bring new complexities to the global financial ecosystem. As 
more and more data become available and computing power increases, Al programs will become more 
complex. But are Al, ML and DL so different from previous advancements they could upend the laws 


of finance? 52 


A fascinating 2018 Financial Times article raises this question. For example, in 2017, depreciation 
represented approximately one-third of global operating cash flows. But what happens to a computer 
with Al that actually becomes smarter over time? Machine learning by definition is the opposite of 
аесау.°? So if an asset grows more valuable with use then it should be negatively depreciated. This 
means for certain technology firms the earnings boost would potentially be quite large. Artificial 
intelligence combined with the internet of things (loT) will result in physical things becoming more 
adaptive and responsive which will extend their useful lives. 


Along with big data, Al is viewed in the financial services sector as a technique that has the potential 
to deliver huge analytical power. Yet many risks still need to be addressed. Many Al techniques remain 
untested in financial crisis scenarios. There have been several instances in which the algorithms 
implemented by financial firms appeared to act in ways quite unforeseen by their developers, leading 
to errors and flash crashes (notably the pound's flash crash following the Brexit referendum in 2016). 
Lo (2016) calls for developing more robust technology capable of adapting to human foibles so that 
users can employ these tools safely, effectively and effortlessly. 


Much remains to be done. And there is clearly a need for more education on Al literacy and 
awareness™. The late Stephen Hawking summed it up: “The rise of powerful Al will be either the best 
or the worst thing ever to happen to humanity. We do not yet know which.” 
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Appendix 


Timeline of artificial intelligence milestones 

1937 Claude Shannon proposes that Boolean algebra can be used to model electronic circuits 
1943 McCulloch & Pitts recognise that Boolean circuits can be used to model brain signals 
1950 Alan Turing develops the Turing Test 

1950 Minsky and Edmonds build the first neural network computer (the SNARC) 

1956 The term “artificial intelligence’ is coined by John McCarthy 

1956 Newell and Simon create the Logic Machine 


1957 Economist Herbert Simon predicts that computers would defeat humans at chess within the 
following decade 


1958 Frank Rosenblatt introduces а new form of neural network known as “perceptron” 
1958 Early genetic algorithms experiments 


1959 Arthur Samuels demonstrates that a computer can play checkers better than its creator, 
and even play against itself to practice 


1961 Newell and Simons creates General Problem Solver 
1964 Computers understand natural language enough to solve algebraic and word problems 
1965 Herbert Dreyfus’ report severely criticises the emerging Al field 


1967 Marvin Minsky predicts that within a generation the problem of creating “artificial 
intelligence” would be solved 


1969 Bryson and Ho develop а back propagation algorithm 

1971 Terry Winograd’s program SHRDLU answers questions in natural language 
1973 UK Lighthill Report ends British government support for Al research 
1974-1980 First “Al Winter” 

1980 Expert Systems, or Knowledge Systems, emerge as a new field within Al 
1980s Early part of decade - Benioff and Feynman create Quantum Computing 
1982 PlanPower is conceptualised by Applied Expert Systems (APEX) 

1982 James Simons starts quant investment firm Renaissance Technologies 
1984 American Association for Al coins the term “Al Winter” 


1987 Personal Financial Planning System (PFPS) used by Chase Lincoln First Bank 
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1987 - 


1988 


1990s 


1990 


1993 


1997 


2005 


2007 


2009 


2010 


2012 


2014 


2016 


2017 


2017 


2018 


2018 


2018 


2018 


2018 


2018 


2018 


1993 Second “Al Winter” 

David Shaw founds D.E. Shaw and is an early adopter of Al among its hedge funds 
The Al industry shows renewed interest in neural networks 

Neural net device reads handwritten digits to determine amounts on bank cheques 
FinCen puts FAIS (its Al system) into service to monitor money laundering 


Deep Blue defeats Garry Kasparov, world chess champion at the time. IBM's stock price 
increases by $18 billion 


The DARPA 132-mile challenge sees Al applied to autonomous driving 

The DARPA Urban Challenge 

Google's first self-driving car 

Flash Crash occurs on 6 May. In 36 minutes, the S&P crashed 8%, before a rebound 


On 1 August, Knight Capital loses $440 million 45 minutes after deploying unverified 
trading software 


Man Group starts to use Al to manage client money 


Google’s DeepMind AlphaGo applies ML algorithms to win at international Go 
championship 


Two Sigma hedge fund which uses ML, crosses the $50 billion in assets under 
management 


Beijing announces plans to lead the world in Al by 2030 
UBS announces development of recommendation algorithms 
The Merkel government announces ЄЗ billion will be spent on Al capabilities 


President Macron announces that all algorithms developed for government use will be made 
publicly available 


Alibaba announces plans to bring Al chips to market the following year 
MiFID II takes effect 
GDPR takes effect on 25 May 


Baidu becomes the first Chinese tech giant to join a US led consortium on Al safeguards 
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